Human copy number polymorphic genes.

نویسندگان

  • J A Bailey
  • J M Kidd
  • E E Eichler
چکیده

Recent large-scale genomic studies within human populations have identified numerous genomic regions as copy number variant (CNV). As these CNV regions often overlap coding regions of the genome, large lists of potentially copy number polymorphic genes have been produced that are candidates for disease association. Most of the current data regarding normal genic variation, however, has been generated using BAC or SNP microarrays, which lack precision especially with respect to exons. To address this, we assessed 2,790 candidate CNV genes defined from available studies in nine well-characterized HapMap individuals by designing a customized oligonucleotide microarray targeted specifically to exons. Using exon array comparative genomic hybridization (aCGH), we detected 255 (9%) of the candidates as true CNVs including 134 with evidence of variation over the entire gene. Individuals differed in copy number from the control by an average of 100 gene loci. Both partial- and whole-gene CNVs were strongly associated with segmental duplications (55 and 71%, respectively) as well as regions of positive selection. We confirmed 37% of the whole-gene CNVs using the fosmid end sequence pair (ESP) structural variation map for these same individuals. If we modify the end sequence pair mapping strategy to include low-sequence identity ESPs (98-99.5%) and ESPs with an everted orientation, we can capture 82% of the missed genes leading to more complete ascertainment of structural variation within duplicated genes. Our results indicate that segmental duplications are the source of the majority of full-length copy number polymorphic genes, most of the variant genes are organized as tandem duplications, and a significant fraction of these genes will represent paralogs with levels of sequence diversity beyond thresholds of allelic variation. In addition, these data provide a targeted set of CNV genes enriched for regions likely to be associated with human phenotypic differences due to copy number changes and present a source of copy number responsive oligonucleotide probes for future association studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TFG, a target of chromosome translocations in lymphoma and soft tissue tumors, fuses to GPR128 in healthy individuals.

BACKGROUND The formation of fusion genes plays roles in both oncogenesis and evolution by facilitating the acquisition of novel functions. Here we describe the first example of a human polymorphic in-frame fusion of two unrelated genes associated with a copy number variant. DESIGN AND METHODS Array comparative genomic hybridization was used to identify cryptic oncogenic fusion genes. Fusion g...

متن کامل

Extensive Copy-Number Variation of Young Genes across Stickleback Populations

Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their pote...

متن کامل

P-157: Polymorphic Core Promoter GA-repeats Alter Gene Expression of The Early Embryonic Developmental Genes

Background: We examine the GA-repeat core promoters of MECOM and GABRA3 in human embryonic kidney-293 cell line and show that those GA-repeats have promoter activity,and those different alleles of the repeats can significantly alter gene expression.We propose a novel role for GA-repeat core promoters to regulate gene expression in the genes involved in development and evolution. Materials and M...

متن کامل

Molecular characterization of Theileria parva parasites from South Sudan using the PCR-RFLP approach on antigen genes

  In an attempt to characterize Theileria parva parasites circulating in South Sudan cattle , polymerase chain reaction (PCR)-based assays were carried out using four single copy encoding antigen genes p104, PIM, p150 and p67 in addition to one microsatellite MS321. A total of 20 bovine DNA samples from two locations in South Sudan were included in this study, in addition to two references stra...

متن کامل

The rabbit alpha-like globin gene cluster is polymorphic both in the sizes of BamHI fragments and in the numbers of duplicated sets of genes.

The alpha-like globin gene cluster in rabbits contains embryonic zeta-globin genes, an adult alpha-globin gene, and theta-globin genes of undetermined function. The basic arrangement of genes, deduced from analysis of cloned DNA fragments, is 5'-zeta 0-zeta 1-alpha 1-theta 1-zeta 2-zeta 3-theta 2-3'. However, the pattern of restriction fragments containing zeta- and theta-globin genes varies am...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Cytogenetic and genome research

دوره 123 1-4  شماره 

صفحات  -

تاریخ انتشار 2008